Experimental Validation of the Clustering by Compression Technique
نویسندگان
چکیده
În zilele noastre, oamenii se confruntă cu o cerere din ce în ce mai mare de cunoştinţe şi informaţii. În acest context, clasificarea datelor este esenţială pentru obţinerea de informaţii structurate ca răspuns la interogările utilizatorilor. În această lucrare vom evalua rezultatele produse de o nouă tehnică de clasificare – clasificarea prin compresie atunci când se aplică asupra unor seturi diferite de date. Procedeul de clasificare prin compresie se bazează pe o distanţă universală de similitudine, numită distanţă normală de compresie sau NCD, calculată pe baza dimensiunii fişierelor de date comprimate. Rezultatele experimentale arată că se pot clasifica corect fişiere de diferite tipuri, fără nici o informaţie prealabilă. NCD a dovedit capacitatea de a evalua distanţa dintre obiectele de diferite tipuri, prin aproximarea distanţei normale de informaţie (NID), o metrică universală, care există doar la nivel teoretic.
منابع مشابه
A Clustering Approach by SSPCO Optimization Algorithm Based on Chaotic Initial Population
Assigning a set of objects to groups such that objects in one group or cluster are more similar to each other than the other clusters’ objects is the main task of clustering analysis. SSPCO optimization algorithm is anew optimization algorithm that is inspired by the behavior of a type of bird called see-see partridge. One of the things that smart algorithms are applied to solve is the problem ...
متن کاملAn Improved SSPCO Optimization Algorithm for Solve of the Clustering Problem
Swarm Intelligence (SI) is an innovative artificial intelligence technique for solving complex optimization problems. Data clustering is the process of grouping data into a number of clusters. The goal of data clustering is to make the data in the same cluster share a high degree of similarity while being very dissimilar to data from other clusters. Clustering algorithms have been applied to a ...
متن کاملAn Improved SSPCO Optimization Algorithm for Solve of the Clustering Problem
Swarm Intelligence (SI) is an innovative artificial intelligence technique for solving complex optimization problems. Data clustering is the process of grouping data into a number of clusters. The goal of data clustering is to make the data in the same cluster share a high degree of similarity while being very dissimilar to data from other clusters. Clustering algorithms have been applied to a ...
متن کاملA Comparative Study between a Pseudo-Forward Equation (PFE) and Intelligence Methods for the Characterization of the North Sea Reservoir
This paper presents a comparative study between three versions of adaptive neuro-fuzzy inference system (ANFIS) algorithms and a pseudo-forward equation (PFE) to characterize the North Sea reservoir (F3 block) based on seismic data. According to the statistical studies, four attributes (energy, envelope, spectral decomposition and similarity) are known to be useful as fundamental attributes in ...
متن کاملWater Quality Zoning of Rivers by the Technique of Fuzzy Clustering Analysis
Zoning the pollution of a river may be the first or even the most important step in water quality management. In order to resolve its pollution, fuzzy clustering analysis may be used whenever a composite classification of water quality incorporates mutiple parameters
 
In such cases, the technique may be used as a complement or an alternative to comprehensive assessment. In fuzzy cluster...
متن کاملWater Quality Zoning of Rivers by the Technique of Fuzzy Clustering Analysis
Zoning the pollution of a river may be the first or even the most important step in water quality management. In order to resolve its pollution, fuzzy clustering analysis may be used whenever a composite classification of water quality incorporates mutiple parameters In such cases, the technique may be used as a complement or an alternative to comprehensive assessment. In fuzzy clustering ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011